K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 55 | 81 | 87 | 91 | 94 |
1000 | 191 | 431 | 627 | 753 | 837 |
10000 | 726 | 2900 | 5154 | 6761 | 7816 |
100000 | 894 | 3672 | 7335 | 10579 | 12967 |
1000000 | 894 | 3672 | 7335 | 10579 | 12967 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings